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© An expression system for producing and isolating large quantities of protein. This system employs an 
expression vector, comprising (a) a coding region for a glutathione-binding polypeptide (glutathione-s-transferase 
preferred), operatively connected to a promoter, (b) a second coding region in-frame with the first coding region, 
and (c) at least one restriction site between the first and second coding regions wherein a fusion protein of the 
first and second coding regions will result from expression of the vector. This vector is used in a host cell, which 
in turn is used in a process for isolating and purifying a protein. This process comprises (a) treating the host cell 
under conditions allowing expression of the vector, whereby a fusion protein of the first and second coding 
regions will be expressed; (b) exposing proteins from the host cell to glutathione resin, whereby the fusion 
protein will adhere to the resin; and (c) cleaving the expression product of the second coding region from the 
resin. Also described is a process for expressing a nucleic acid sequence, which comprises (a) inserting the 
nucleic acid sequence into a baculovirus expression vector in-frame with the first coding region; (b) placing the 
vector into a host cell; (c) treating the host cell under conditions allowing expression of the vector, resulting in 
expression of a fusion protein of the first coding region and the sequence inserted in step (a); (d) exposing 
proteins from the host cell to glutathione resin, whereby the fusion protein will adhere to the resin; and (e) 
treating the adhered fusion protein with a protease to release the expression product of the nucleic acid 
sequence from the resin. A baculovirus /Spodoptera frugiperda expression system is preferred. 
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The present invention relates to processes for expression of proteins and to xpr ssion vectors and 
host cells used ther for. 

The Ick gene product, p56^, is a member of the src family of protein tyrosine kinases. Cooper, J.A. 
(1990) in Peptides and Protein Phosphorylation (Kemps, B.E.. ed) pp. 85-113, CRC Press, Boca Raton, FL. 
5 The Ick protein is normally expressed in T lymphocytes and natural killer cells, where it likely performs a 
variety of functions relating to signal transduction through ligand binding to selected surface proteins. Bolen, 
J A, and Veillette, A. (1989) Trends Biochem. Sci . 14, 404- 407; Rudd, C.E. (1990) Immunol. Today 11, 
400-406. In T-cells, p56^ forms a non-covalent complex with the CD4 and CD8a. Veillette, A., Bookman, 
MA, Horak, E.M., and Bolen, J.A. (1988). For this reason, p56^ j S believed to aid in mediation of signals 

to emanating from the T-cell antigen receptor through ligation of CD4 or CD8 to non-polymorphic determinants 
on antigen-bearing major histocompatibility molecules. Shaw, A.S.. Chalupny, J. f Whitney, J.A., Hammond, 
C Amrein, K.E., Kavathas, P.. Sefton. B.M., and Rose. J.K., (1990) Mo). Cell. Biol. 10. 1853-1862; Doyle. C, 
and Strominger, J.L. (1987) Nature 330, 256-259; Norment. A.M., Salter, R.D., Parham. P., Engelhard, V.H., 
and Liftman. D.R. (1988) Nature 336, 79-81. More recently. p56^ has been implicated as a signaling 

is component of the high affinity interleukin-2 receptor. Hatakeyama, M., Kono, T.. Kobayashi, N., Kawahara, 
A., Levin, S.D., Perlmutter, R.M., and Tanaguchi, T. (1991) Science 252, 1523-1528. 

A better understanding of the structure and regulation of p56^ and similar proteins would clearly 
contribute to our knowledge of early signal transduction events and a source of large quantities of purified 
p56^ would be useful. While early analysis of p56^ functions have been greatly facilitated by antibodies 

20 directed against this protein, immunoaffinlty purification has been hampered by lack of an abundant source 
of enzyme. This difficulty has been addressed in part by baculovirus expression systems. Summers, M.D.. 
and Smith. G.E. (1987). A Manual for baculovirus vectors and insect cell culture procedures . Texas A&M 
bulletin No. 1555, (College Station. Texas Agricultural Experimental Station and Texas A&M University), 10- 
39. Recent studies using a baculovirus expression system have reported significant purification of p56^ 

25 using conventional chromotography methodologies. Ramer S.E.. Winkler. D.G., Carrera, A.. Roberts, T.M., 
and Walsh, C.T. (1991) Proc. Natl. Acad. Sci. USA 88, 6254-6258; Watts, J.D., Wilson. G.M., Ettehadieh, E., 
Clark-Lewis, I., Kubanek. C. Astell, C.R.. Marth, J.D., and Aebersold, R, (1991) J. Biol. Chem. 267, 901-907. 
While this approach results in purified enzyme, multiple column enzyme purification is costly, time- 
consuming, and requires large amounts of starting material. 

30 Glutathione-s-transferase (Gst) is a protein well known to bind to glutathione (Smith, D.B., and Johnson, 
K.S. (1988) Gene 67. 31-40). Glutathione resin may be used in column chromatography. The above 
baculovirus expression systems, however, do not employ Gst. 

The present invention relates to processes for expressing isolated forms of proteins and to expression 
vectors and host cells useful for such processes. In particular, this invention relates to an expression vector, 

35 comprising: 

(a) a first coding region, which codes for a polypeptide capable of binding to glutathione, operatively 
connected to a promoter, 

(b) a second coding region in-frame with the first coding region, and 

(c) at least one restriction site between the first and second coding regions; 

40 wherein a fusion protein of the first and second coding regions would result from expression of the vector. 
Vectors derived from baculovirus are preferred. 

Further in accordance with this invention is a host cell comprising such a vector. The preferred host cell 
is a Spodoptera frugiperda cell, particularly an Sf9 cell, although other host cells are suitable (see below). 
Such vectors and host cells are useful in a process for expressing a protein in isolated form, which 
45 comprises: 

(a) treating such a host cell under conditions allowing expression of the vector, whereby a fusion protein 
of the first and second coding regions will be expressed; 

(b) exposing proteins from the host cell to glutathione resin, whereby trie fusion protein will adhere to the 
resin; and 

so (c) clearing the expression product of the second coding region from the resin-bound fusion protein. 

Further in accordance with the present invention is a process for expressing a nucleic acid sequence, 
which comprises: 

(a) inserting the nucleic acid sequence into a baculovirus expression vector in-frame with a first coding 
region for a polypeptid capable of binding to glutathione, wherein the coding region is operatively linked 

55 to a promoter, 

(b) placing the vector into a host cell; 

(c) treating the host cell under conditions allowing xpr ssion of the vector, resulting in expression of a 
fusion protein of the first coding region and th sequ nee inserted in step (a); 
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(d) exposing proteins from the host cell to glutathione resin, whereby the fusion protein adh res to the 
resin; and 

(e) treating the adhered fusion protein with a protease to release the expression product of the nucleic 
acid sequence from the resin. 

5 For the first coding region, the inventors prefer a sequence encoding glutathione-s-transf erase 
(nucleotide SEQ. ID. NO.: 1; amino acid SEQ. ID. NO.: 2) or a fragment thereof capable of binding to 
glutathione. This system combines the high level expression of foreign proteins with baculovirus vectors 
(e.g., in Sf9 cells) and the ability of Gst fusion proteins to bind to glutathione resin. Treatment of the 
glutathione-binding fusion protein with a proteolytic substance such as thrombin can thus liberate the 

70 desired protein from the glutathione-binding portion of the fusion protein. The glutathione-binding portion 
remains bound to the resin, thus purifying the desired protein. 

This expression system presents advantages over other systems, because it allows the practitioner (1) 
to produce large quantities of protein, (2) to purify significant amounts of active protein by a single 
chromatography step, (3) to use a wide range of extraction conditions, including non-denaturing detergents 

T5 to maintain protein function, (4) to use anti-Gst antibodies, allowing for screening of recombinant 
bacubviruses that express cloned sequences to which antibodies have not been generated or proteins 
whose function can not be measured, (5) to use a multiple cloning site with many restriction sites for 
convenient ligation, and (6) to use and/or study thrombin because it includes a thrombin cleavage site. 
The following definitions apply to the terms as used throughout this specification, unless otherwise 

20 limited in specific instances. 

The term "fusion protein" refers to a protein or polypeptide that has an amino acid sequence having 
portions corresponding to amino acid sequences from two or more proteins. The sequences from two or 
more proteins may be full or partial (i.e.. fragments) of the proteins. Such fusion proteins may also have 
linking regions of amino acids between the portions corresponding to those of the proteins. Such fusion 

25 proteins may be prepared by recombinant methods, wherein the corresponding nucleic acids are joined 
through treatment with nucleases and ligases and incorporated into an expression vector. Preparation of 
fusion proteins is generally understood by those having ordinary skill in the art. 

The phrase "polypeptide capable of binding to glutathione" refers to proteins, protein fragments, and 
synthetic polypeptides capable of binding to glutathione. Examples include glutathione-s-transferase and 

30 fragments thereof. Suitable fragments may be generated by gene amplification using 5* and 3' primers 
before translation or by proteolytic cleavage (see Table 1) after translation. 

The term "coding region" refers to an open reading frame; i.e., a portion of a nucleic acid that has a 
sequence that would be translated to form a sequence of amino acids. The term "coding region" includes 
sequences of naturally occurring proteins as well as sequences resulting from modifications (insertions, 

35 deletions, mutations, disruptions) obtained through recombinant methods. 

The term "(inking region" refers to a sequence of amino acids between coding regions from different 
sources in a fusion protein. Typically, linking regions encode sites recognized by proteases and thus allow 
the expression products of the coding regions to be separated from each other. 

The phrase "operatively linked to a promoter" means that the promoter is capable of directing the 

40 expression of the associated coding region. Coding regions for the fusion protein may also be operatively 
linked to other regulatory elements, such as enhancers. 

The preferred embodiment employs a Gst sequence within commercially available expression vector 
pGEX-2T. This sequence is derived from Schistosoma japonicum . A number of species are known to 
produce active isoforms of Gst, all of which are useful in the present invention. 

45 Coding regions for the fusion protein may be spliced into an expression vector by means well 
understood by those having ordinary skill in the art. Suitable expression vectors may be constructed using 
standard recombinant DNA techniques known in the art, many of which are described in Sambrook, et al. , 
Molecular Cloning; A Laboratory Manual . Second Edition, Cold Spring Harbor Laboratory. Cold Spring 
Habor. NY (1989). 

so Suitable expression vectors in accordance with the present invention comprise a coding region for a 
polypeptide capable of binding to glutathione, along with an in-frame sequence for the protein to be 
isolated. The coding region for the protein to be isolated may be located upstream or downstream of the 
coding region for the glutathione-binding polypeptide. Preferred are expression vectors comprising one or 
more regulatory DNA sequenc s operativ ly linked to the DNA sequence coding for all or part of Gst. 

55 Expression vectors useful in th pr sent invention typically contain an origin of replication, a promoter 
located 5' to (i. ., upstream of) th Gst fusion protein sequenc . which is followed by downstream 
transcription t rmination sequences, and th remaining vector. Control regions derived from a number of 
sources may b mployed in accordanc with th present invention. Suitable origins of replication include, 
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for example, the Col El, the SV40 viral and the M13 orgins of replication. Suitable promoters include, for 
example, the cytomegalovirus promoter, the lac Z promoter, the gal 10 promoter and the Autographa 
califomica multiple nuclear polyhedrosis virus (AcMNPV) polyhedral promoter. Suitable termination se- 
quences include, for example, SV40, lac Z and AcMNPV polyhedral polyadenylation signals. An expression 

5 vector as contemplated by the present invention is at least capable of directing the replication, and 
preferably the expression, of the nucleic acids encoding the fusion proteins. 

The expression vectors may also include other DNA sequences known in the art; for example, stability 
leader sequences which provide for stability of the expression product; secretory leader sequences, which 
provide for secretion of the expression product; sequences that allow expression of the structural gene to 

70 be modulated (e.g., by the presence or absence of nutrients or other inducers in the growth medium); 
marking sequences, which are capable of providing phenotypic selection in transformed host cells (e.g., 
genes for neomycin, ampiciliin, and hygromycin resistance and the like); and sequences that provide sites 
for cleavage by restriction endonucleases. All of these materials are known in the art and are commercially 
available. 

75 The characteristics of the actual expression vector used must be compatible with the host cell to be 
employed. The vector thus may include sequences which allow expression in various types of host cells, 
including but not limited to prokaryotes, yeasts, fungi, plants and higher eukaryotes. For example, when 
expressing DNA sequences in a mammalian cell system, the expression vector should contain promoters 
isolated from the genome of mammalian cells, (e.g., mouse metallothionien promoter), or from viruses that 

20 grow in these cells (e.g., baculovirus promoter, vaccinia virus 7.5 K promoter). 

Suitable commercially available expression vectors into which DNA sequences for the fusion proteins 
may be inserted include the mammalian expression vectors pcDNAI or pcDNA/Neo, the baculovirus 
expression vectors pBlueBac and pVL1393 (which is preferred), the prokaryotic expression vector pcDNAII 
and the yeast expression vector pYes2, all of which may be obtained from Invitrogen Corp., San Diego, CA. 

25 Preferred are commercially available vectors that already have Gst sequences included, such as pGEX-2T. 
The present invention additionally concerns host cells containing an expression vector that comprises a 
DNA sequence coding for a Gst fusion protein. The host cells preferably contain an expression vector which 
comprises all or part of the DNA sequence for the protein to be isolated together with a DNA sequence for 
a polypeptide capable of binding glutathione. See, for example, the expression vector appearing in the 

30 Experimental Procedures hereinbelow, which is preferred. Further preferred are host cells containing an 
expression vector comprising one or more regulatory DNA sequences capable of directing the replication 
and/or the expression of and operatively linked to a DNA sequence coding for all or part of the fusion 
protein. Suitable host cells include both prokaryotic and eukaryotic cells. Suitable prokaryotic host cells 
include, for example, E. coli strains HB101. DH5o, XL1 Blue, Y1090 and JM101. Suitable eukaryotic host 

35 cells include, for example, Spodoptera frugiperda insect cells (which are preferred), COS-7 cells, human 
skin fibroblasts, and Saccharomyces cerevisiae cells. 

Expression vectors may be introduced into host cells by various methods known in the art. For 
example, transection of host cells with expression vectors can be carried out by the calcium phosphate 
precipitation method. However, other methods for introducing expression vectors into host cells, for 

40 example, electroporation. liposomal fusion, nuclear injection, and viral or phage infection can also be 
employed. 

Once an expression vector has been introduced into an appropriate host cell, the host cell may be 
cultured under conditions permitting expression of large amounts of the fusion protein. 
Figure 1 : Construction of pBMS-l 
45 A. Outline of the cloning procedure. The glutathione-s-transferase gene was cloned into the Bam H-1 

site of the Sf9 expression vector pVL1393 to make the Gst fusion expression vector pBMS-1. The 
restriction map of the pBMS-1 polylinker, and the thrombin cleavage site are shown. 
B. Schematic of the GstLck fusion junction . Ick was joined to the Gst coding sequence using a Stu-1 
site located 24 base pairs upstream of the Ick inflation methionine codon. ~~ 
so Figure 2: Analysis of GstLck purified from Sf9 cells. 

A - SDS-PAGE analysis and Coomassie staining pattern . Lane 1 shows the result from 50 ug of total 
protein from infected Sf9 cells; lane 2, 1 ug of purified GstLck; lane 3. 0.5 ug of thrombin-cleaved 
GstLck (recombinant p56^). 

B. SDS-PAGE analysis of autophosphorylated GstLck . Lane 1 shows the result from autophosphoryla- 
55 tion of GstLck; lane 2, autophosphorylation of recombinant p56^. 

C Western blot analysis of th sample used in panel B using a polyclonal rabbit anti-lck antibody . 
Lane 1 shows th result from GstLck; Lane 2, recombinant p56!^. 
Figure 3: Autophosphorylation of GstLck. 
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A. Western blot analysis of p56 ^. Lan 1 shows the r suit from immunoprecipitated p56^ from OEM* 
6 cells; lanes 2-4, GstLck from infected Sf9 cell lysat s purified using the following methods. Lane 2. 
immunoprecipitation using anti-lck polyclonal antibodies; lane 3, immunoprecipitation using anti-Gst 
polyclonal antibodies; lane 4, affinity purification using glutathione resin. 
5 B. Analysis of the enzymatic activity of p56^r GstLck purified as outlined in panel A . Activity was 

assessed by autophosphorylation. The same protein samples and quantities were loaded as in panel 
A. 

Figure 4: Phosphorylation of enolase by GstLck. 

A. Phosphorylation of enolase as a function of GstLck concentration . Each reaction was carried out for 
to 1 minute at 30 *C, with 3 ug of enolase as substrate, and varying amounts of GstLck. Lane 1 shows 

the result from 0 ug GstLck; Lane 2, 0.04 ug GstLck, lane 3. 0.08 ug GstLck; lane 475.12 ug GstLck; 
lane 5. 0.2 ug GstLck; lane 6, 0.28 ug GstLck; lane 7, 0.36 ug GstLck; lane 8, 0.44 ug GstLck; lane 9, 
0.52 ug GstLck. ~ 

B. Time course of enolase phosphorylation by GstLck . Each reaction was carried out at 30 *C, with 
T5 0.4 ug of GstLck, and 3 ug of enolase as substrate. Lane 1 shows the result after 0 minutes; lane 2, 

0.5 minute; lane 3, 1 minute; lane 4, 2 minutes; lane 5, 3 minutes. 
Figure 5: Phosphorylation of enolase by thrombin-cleaved GstLck. 

A - Phosphorylation of enolase as a function of recombinant p56 l ^concentration . Each reaction was 
carried out for 1 minute at 30 # C, with 3 ug of enolase as substrate, and varying amounts of 
20 recombinant p56^. Lane 1 shows the result from 0 ug p56^; lane 2, 0.01 ug p56^; lane 3, 0.02 ug 

p56^; lane 4, 0.03 ug p56^;lane 5, 0.05 ug p56^; lane 6, 0.07 ug p56^; lane 7, 0.09 ug p56^ 
lane 8, 0.11 ug p56^. 

B - Time course of enolase phosphorylation by recombinant p56 ^. Each reaction was carried out at 
30 'C, with 0.01 ug of recombinant p56^, and 3 ug of enolase as substrate. Lane 1 shows the result 
25 after 0 minutes; lane 2, 0.5 minutes; lane 3, 1 minute; lane 4, 2 minutes; lane 5, 3 minutes. 

Experimental Procedures 

Construction of p56^* expression vectors. A Stu-1 fragment from the mouse Ick cDNA (Marth, J.D., 
30 Peet. R„ Krebs, E.G., and Perlmutter, R. (1985) Cell 43. 393-404) was cloned into the fiiled-in Eco-R1 site of 
the vector pGEX-2T (Pharmacia). The resulting plasmid pGEX-Ick is capable of expressing a glutathione-s- 
transferase/Lck (GstLck) fusion protein when transfected into E. coli cells. The GstLck coding sequence 
from pGEX-Ick was amplified by PCR. The 5* PCR primer & 

5' TAT AAA TAT GTC CCC TAT ACT A 3' 
(SEQ. ID. NO.: 3), 

40 was synthesized on an Applied Biosystems, inc. model 380A synthesizer. This primer hybridizes to the 5' 
region of the Gst coding sequence and encodes the ribosome binding site for the baculovirus polyhedrin 
gene. The 3* PCR primer, 

45 5' CGT CAG TCA GTC ACG AT 3' 

(SEQ. ID. NO.: 4), 

hybridizes to sequences immediately 3' to the polylinker of pGEX-2T. This primer pair can be used to 
so amplify any sequence cloned into the polylinker of pGEX-2T as a Gst/insert fusion. The amplified GstLck 
coding sequence was cloned into the vector pCRIOOO (InVitrogen, inc.) resulting in the plasmid pCR100f> 
GstLck. The pCRIOOO vector was designed for easy cloning of PCR-amplified DNA. and was used as an 
intermediate cloning vector. A Not-1, Bgl-ll fragment from pCR1000-GstLck containing GstLck coding 
sequence was cloned into the Not-I, BgHI sites of pVL1393. Lukow, V.A., and Summers. IvLD (1988) 

55 Virology 167, 56-71. Th resulting plasmid. pVL1393-GstLck (A.T.C.C. Accession No. , American Type 

Culture Collection, 12301 Parklawn Driv , Rockville, Maryland 20852-1776) was used to produce a 
recombinant baculovirus in Spodoptera frugiperda 9 (Sf9) cells following standard procedures. Summers, 
M.D., and Smith, G.E. (1987). A Manual for baculovirus vectors and insect cell culture procedur s , T xas 
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A&M bulletin No. 1555, (College Station, Texas Agricultural Experimental Station and T xas A&M Univer- 
sity), 10-39. The cloning scheme used for the construction of pBMS-l is outlined in figure 1A. The PCR 
primers used are the same described above. 

Purification of GstLck from Sf9 cells. A 500 mL spinner culture of infected Sf9 cells in Excel I-4O0 

5 medium (JRH Biosciences) was harvested 48 hours after infection by centrifugation at 4 • C for 5 minutes. 
The cells were lysed in 50 mL of cold 50 mM Tris pH 8.0, 150 mM NaCI. 2 mM EDTA, 1 mM DTT, 1%- 
(vol/vol) NP-40. 1 mM PMSF, 0.1 mg/mL aprotinin, 0.1 mg/mL leupeptin, 1 mM NaF, and 1 mM NaaVO* - 
(lysis buffer). Insoluble materiaJ was removed by centrifugation at 10.000 x g for 10 minutes at 4 # C. The 
resulting cell lysate was determined to have a protein concentration of 9.5 mg/mL using the Coomassie 

io Protein Assay Reagent (Pierce). 

The GstLck protein was purified by a one-step affinity chromatograpy procedure using glutathione resin 
as described by the manufacturer (Pharmacia). For this experiment, 50 mg of Sf9 cellular lysate containing 
the GstLck protein was added to a 2-mL glutathione column and the unbound material removed by washing 
with 50 mL of lysis buffer. Bound proteins were eluted from the column with 2 column volumes of lysis 

75 buffer containing 5 mM glutathione. Eluted protein was diluted to 15 mL with lysis buffer and concentrated 
using a Centriprep 30 Concentrator unit (Am icon, Inc.). Two additional dilutions and concentrations were 
performed to remove the remaining glutathione. The concentrated protein was adjusted to 10% glycerol and 
stored at -70 *C. This procedure yielded 28.0 mg of greater than 99% pure GstLck as determined by SDS* 
PAGE and Coomassie Blue staining analysis. 

20 To obtain p56^ protein lacking the Gst peptide sequences, GstLck was digested with the proteolytic 
enzyme thrombin to generate cleaved p56^ (cp56^. For this procedure 5 mg of thrombin was added to 
20 mg of purified GstLck in a volume of 50 mL lysis buffer, containing 2.5 mM CaCfc for 1 hour at 25 *C. 
To remove uncleaved GstLck and cleaved Gst. the products were mixed with 20 mL of glutathione resin. 
The glutathione resin was removed by centrifugation leaving the cp56^ in the supernatant. The yield from 

25 this procedure was approximately 5 mg of recombinant p56^ which was stored in 10% glycerol at -70 * C. 

Immune-complex protein kinase assays. Analysis of protein kinase activity conducted on immune- 
complexes was carried out as previously described. Veiilette. A.. Horak, I.D., Horak, E.M., Bookman. M.A., 
and Bolen, J A (1988) Mol. Cell. Biol. 8, 4353-4361. Briefly, immune-complexes formed from cellular 
lysates and the indicated antisera were collected by the addition of formalin-fixed Staphylococcus aureus - 

30 (Pansorbin, Calbiochem) and washed extensively in lysis buffer. Protein kinase reactions were initiated by 
the addition of 30 mL kinase buffer (20 mM MOPS pH 7.5 mM MnCfe. 1 mM ATP) containing 12.5 uCi [ Y - 
32 PJ-ATP (3000Ci/mmol, New England Nuclear). The reactions were allowed to proceed for 5 minutes at 
room temperature and stopped by addition of an equal volume of 2X SDS loading buffer (0.125 M Tris-HCI 
pH 6.8, 4% (weight/vol) SDS, 20% (vol/vol) glycerol, 10% (vol/vol) 2-mercaptoethanol). The phosphorylated 

35 products in SDS loading buffer were heated for 5 minutes at 90 *C and analyzed by SDS-PAGE and 
autoradiography. The ^P-labeled bands of interest were excised from the gel and counted in a Beckman 
LS6000TA liquid scintillation counter. 

Soluble protein kinase assays. The enzymatic activity of GstLck and cp56^ were evaluated by their 
capacity to phosphorylate the Lck exogenous substrate rabbit muscle enolase (Sigma). To determine the 

40 time course of enolase phosphorylation. 3 ug of GstLck or 1 ug of cp56^ was added to 100 ul of kinase 
buffer containing 12 ug enolase and 25 uCi [y-^P] ATP and the reactions were conducted at 30 'C for the 
indicated times. At each point, 10 uL of the reaction mix was removed, added to 30 uL of 2X SDS loading 
buffer and heated for 5 minutes at 90 *C. The reaction products were analyzed by SDS-PAGE and 
autoradiography. The bands corresponding to enolase were excised from the gel and counted by liquid 

45 scintillation spectroscopy. To determine the Km for enolase, serial dilutions of enolase were added to kinase 
buffer containing 5 uCi [t^PJ-ATP, and either 0.1 ug of GstLck or 0.01 ug of cp56^ were added per 
reaction. Reaction conditions and the counts incorporated into enolase were determined as described 
above. For the Km determination of ATP, a 1:10 dilution of fr-^PJ-ATP was added to kinase buffer 
containing 3 ug enolase. For each ATP dilution, 1 ug of cp56^ was added in a total volume of 30 uL and 

so reacted for 30 seconds at 30 *C. Reactions were stopped by addition of 30 uL of 2X SDS loading buffer 
and heated to 90 *C. The reaction products were analyzed by SDS-PAGE, the phosphorylated proteins 
visualized by autoradiography, and ^P incorporation determined by liquid scintillation spectroscopy of the 
excised bands. 

Other biochemical assays and materials. Lck immunoblot analysis was conducted as previously 
55 d scribed using rabbit anti-Lck antis ra Veiilette, A., Bookman, M.A., Horak, E.M.. and Bolen, J.A. (1988) 
Cell 55. 301-308. Partial proteolytic peptid analysis using Staphylococcus aureus V8 protease (Pierce) has 
also been pr viously described. V illett , A.. Horak. I.D.. Horak. E.M., Bookman. M.A., and Bolen. J.A. 
(1988) Mol. Cell. Biol. 8, 4353-4361 ; Marth, J.D., Cooper, JA, King, C.S., Ziegler, S.F., Tinker, D.A, Overell, 
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RA Krebs, E.G., and Perlmutter, R.M. (1988) Mol. Cell. Biol . 8. 540-550. Th human T-cell lymphoma cell 
line CEM was grown In RPMI 1640 m dia supplem nted with 10% (vol/vol) fetal bovine serum and 
antibiotics (penicillin/streptomycin). For immunoprecipitation experiments, the cells were washed in phos- 
phate buffered saline, collected by centrifugation. lysed in lysis buffer, and adjusted to 1 mg/ml prior to 
5 addition of anti-Lck antisera. Antisera directed against Gst was prepared by immunization of rabbits with 
purified Gst. Antisera directed against Lck amino acids 39-58 has been previously described. Veillette, A., 
Bookman. MA, Horak, E.M., and Bolen, J A (1988) Cell 55, 301-308. 

Results 

70 

Construction of expression vectors. Figure 1A outlines the cloning strategy used to create the 
expression vector pBMS-l. The Gst coding sequence from pGEX-2T was cloned by PCR amplification, and 
ligated into the baculovirus expression vector pVL1393. The 5' PCR primer was designed to optimize 
translation of the Gst coding sequence in Sf9 cells. This was accomplished by changing the sequence 

is surrounding the initiation methionine of Gst to encode the ribosomal binding site of the baculovirus 
polyhedrin gene. The pBMS-l polylinker contains 9 unique cloning sites, and can be used to make a 
recombinant baculovirus that expresses inserts as a Gst fusion protein in Sf9 cells. 

The fusion junction of the GstLck coding sequences cloned into pVL1393 is schematically shown in 
figure 1B. The thrombin cleavage site is also indicated. This plasmid pVL1393-GstLck was used to make a 

20 recombinant baculovirus that expressed high levels of the GstLck fusion protein in Sf9 cells. Thrombin 
cleavage of GstLck protein resulted in a recombinant p56^ (cp56^) molecule containing an additional 13 
amino acids at the Lck amino-terminus. These additional amino acids had no apparent affect on the in vitro 
enzymatic activity of recombinant p56^. This was determined by comparing the immune-complex protein 
kinase activities of cp56^ with that of wild-type p56^ expressed in Sf9 cells. 

25 Purification of GstLck from Sf9 cells. Total detergent lysates were made from Sf9 cells expressing 
the GstLck fusion protein as outlined in Experimental Procedures. Lysate containing GstLck was bound to a 
glutathione-sepharose column and eluted with 5 mM glutathione in lysis buffer. The glutathione-bound 
products from this column were analyzed by Coomassie staining following fractionation on SDS 
polyacrylamide gels. As shown in figure 2A, a single polypetide of approximately 83 kDa was observed 

30 which corresponds to the expected size for the GstLck fusion protein. Following thrombin cleavage (figure 
2A, lane 3), the recombinant Lck protein was observed to migrate as two closely spaced bands at 
approximately 56 kDa. 

Functional analysis of GstLck and cp56^. To evaluate the kinase activity of the purified GstLck and 
C p56^ proteins, protein kinase assays were performed. The results of these reactions (figure 2B) 

35 demonstrated that purified GstLck and cp56*£* maintained their autophosphorylation capacity. As expected, 
no kinase activity was detected in purified preparations of Gst. The data shown in figure 2C represents the 
corresponding Lck immunoblot using polyclonal rabbit antibodies against the p56^ unique region. Based 
on the relative amounts of Lck protein detected in the kinase reactions, it appears that the specific activity 
of the cp56!^ may be slightly higher than that of the GstLck fusion protein. Anti-phosphotyrosine 

40 immunoblot analysis of similar reaction products generated using non-radioactive ATP demonstrated that 
the autophosphorylation products (as well as the phosphorylation of exogenous protein substrate enolase 
used in other experiments) were phosphorylated on tyrosine residues. Additionally, partial V8 peptide 
analysis of the autophosphorylation products of the GstLck and cp56^ reactions yielded major V8 
phosphopeptides indistinguishable from that of T-cell derived p56^ autophosphorylated in immune-complex 

45 kinase assays. 

The level of GstLck enzymatic activity was also compared to that of wild type p56^ immunoprecipitat- 
ed from T-cell detergent lysates. For these experiments, GstLck was precipitated from infected Sf9 
detergent lysates with anti-Lck antisera. anti-Gst antisera, or with glutathione-Sepharose beads. The p56^ 
from T-cell lysates was immunoprecipitated with anti-Lck antisera. The various complexes were washed 

so extensively with lysis buffer and divided into two equal aliquots. One aliquot was used to perform protein 
kinase assays (figure 3B) while the other aliquot was used for Lck immunoblot analysis (figure 3A). The 
results of this experiment demonstrate that precipitation of the GstLck protein using either antibodies or 
glutathione beads yielded molecules with similar specific activities as assessed by autophosphorylation. 
Comparison with p56^ derived from T-cells showed that th specific activity of the Sf9 derived GstLck 

55 protein was significantly higher. ~ — 

To furth r charact rize the kinetic parameters of GstLck and cp56^, kinase activity of the fusion protein 
and cleaved enzym was studied using rabbit muscle enolase as an xogenous substrate. As shown by the 
data presented in ftgur 4, th phosphorylation of enolase by GstLck was found to be both tim and 
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concentration dependent. Similar results w re obtained for cp56^ (figure 5). The Km and values for 
ATP and enolase w re det rmined using a reaction time of 30 seconds and the results summarized in 
Table I. The affinity of cp56^ for enolase was found to be approximately 10-fold higher then that of GstLck. 
More critically the K„, and values determined for cp56^ are comparable to values obtained for other 
src family members. 

Attempts to produce functional GstLck in E. coli were unsuccessful. The resulting fusion protein was 
expressed, but it lacked detectable protein kinase activity and was found to be insoluble in detergents. The 
latter feature is common to expression of many eukaroytic proteins in bacteria. Marston, A.O. (1986) J. 
Biochem. 240, 1-12; Miller. D.W., Saher, P.. and Miller, L.K. (1986) in Genetic Engineering , vol. 8, pp. 277^ 
298, Plenum, New York; Miller. L.K. (1989) in Ann. Rev. Microbiol. 42. 177-199. Among the advantages of 
expression of eukaryotic proteins in Sf9 cells is the capacity of these cells to allow protein folding and post- 
translational modification that maintain protein solubility. In the case of Lck, expression of the wild-type 
p56^ in Sf9 cells using conventional baculovirus expression vectors has shown that Lck is myristylated and 
phosphorylated on serine and threonine residues. Thomas. J.E., Soriano, P., and Brugge, J.S.. (1991) 
Science 254..568-571. Since Lck in this system is expressed as a fusion protein with Gst at the 
aminoterminus, it is unlikely that myristylation occurs. We have not determined whether the GstLck is 
phosphorylated on serine or threonine residues. 

Discussion 

The lck coding sequences were ligated downstream from the Gst coding region in-frame to yield a 
plasmid capable of encoding a Gst-p56^ fusion protein. The p56^ produced in this manner was found to 
be a highly active protein kinase, and exhibited the expected biochemical properties of a member of the src 
family. 

Analysis of both the GstLck fusion protein as well as the cp56^ indicated that each retained significant 
protein tyrosine kinase activity as measured by autophosphorylation and tyrosine phosphorylation of the 
exogenous substrate rabbit muscle enolase. Importantly, the Gst sequences, whether fused to Lck or 
following cleavage from the kinase with thrombin, were not phosphorylated in immune-complex kinase 
assays or in kinase assays conducted in solution. Both the GstLck and the cp56^ were found to have 
substantially higher specific activities than p56^ derived from T-cells when measured by immune-complex 
protein kinase assays. The altered specific activity is likely to be the result of diminished carboxy-terminal 
tyrosine (tyrosine 505) phosphorylation for Lck in Sf9 cells although we have not determined the 
phosphorylation sites of Lck in these cells. Veillette. A., Horak. I.D.. Horak. E.M., Bookman. M.A.. and Bolen. 
JA (1988) Mol. Cell. Biol. 8, 4353-4361; Marth, J.D., Cooper. J A, King, C.S., Ziegler. S.F., Tinker, D.A.! 
Overell, RA, Krebs, E.G.. and Perlmutter, R.M. (1988) Mol. Cell. Biol. 8. 540-550. The lack of tyrosine 505 
phosphorylation of Lck, like that observed with Sf9-derived ppW^ (Morgan, D.O.. Kaplan, J.M., Bishop. 
J.M., and Varmus, H.E. (1989)Cell 57, 775-786). is probably attributable to the absence of expression of 
other tyrosine protein kinases such as Csk that are thought to phosphorylate the Src class of kinases at this 
site. Okada, M., and Nakagawa, H. (1989) J. Biol. Chem. 264, 20886-20893; Okada. M.. and Nakagawa, H. 
(1988) Biochem. Biophys. Res. Commun. 154, 796-802. 

From 50 mg of total Sf9 protein lysate, the foregoing procedure purified 280 mg of greater than 99% 
pure (by silver and Coomassie staining) recombinant p56^. From one liter of infected Sf9 cells, this system 
produced approximately 8-10 mg of purified recombinant Lck. 

The foregoing procedures were also used to produce GstLynB, GstSyk. GstBlk. GstFyn, and GstVes 
fusion proteins with comparable results and yields to that reported here for Lck. 
The abbreviations used throughout this specification are defined as follows. 



ATP 


adenosine triphosphate 


DNA 


deoxyribonucleic acid 


DTT 


dithiothreitol 


MOPS 


(3-[N-morpholino]propanesurfonic acid) 


PCR 


polymerase chain reaction 


PAGE 


polyacrylamide gel electrophoresis 


PMSF 


phenylmethylsulfonyl fluoride 


SDS 


sodium dodecyl sulfate 



The gene for GST can be cleaved by enzymes at the positions shown in Table 1. Such nucleic acid 
fragments can be used to generate partial Gst polypeptides in the fusion proteins of the present invention. 
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Table 1 


n 


ECONl 


I J 


3 fell 


13 


BsiYl 


13 


Bsll 


13 


Mael 


13 


Rmal 


17 


BsmF 1 


26 


EcoRl* 


26 


Tsp509 1 


29 


Msel 


33 


ASUl 


33 


BSlZl 


33 


C£xl3I 


33 


Drall 


33 


ECOU1091 


33 


NsplV 


33 


Sau96X 


1C 

J3 


BsuRl 


*>5 
J3 


oaexxl 


*»s 


Pal 1 
rail 


J D 


cSSl 


CI 
JX 


Taqi 


SI 


X XJlflPOX 


03 


BCQx 




rami 1 nil 
r Affix lU^l 


AO 


ftoll 


AO 

ou 


KSpOjZl 


03 


Mboll 


qc 


wax i 


0*7 


MOO 11 


102 


ninox 


1 ft*> 

1UZ 


BlnPll 


1 ft*? 
XUZ 


HlnPl 


1 Ail 

1U4 


AccXl 


1 04 


BeTnl 1 
aStUtJD X 


1 04 


O5703UX 


1 04 


oStUX 


104 


l»XOX 


104 


c nuuxx 


104 


Hhal 


104 


Mvnl 


104 


Thai 


121 


Acil 


124 


Bphl 


139 


EcoRl* 


139 


Tsp509 1 


154 


Mboll 


188 


Msel 


190 


EcoRl* 


190 


Tsp509 1 


193 


Rphl 


193 


MSel 


205 


BsaJU 


206 


Cfrl 


206 


Eaej 


208 


Ball 


206 


BsuRl 


208 


Haelll 



208 


Mscl 


208 


Pall 


216 


Maell 


226 


Alul 


239 


Afllli 


243 


Nlalll 


243 


Nsd7S241 


243 


NspHl 


243 


Nspl 


28*7 


BsQl 


292 


Bsrfi 1 


319 


Taql 


319 


TthflB81 


323 


EcoRl* 


323 


Tsp509 1 


333 


BanAl 


367 


Doel 


375 


Alul 


394 


Asp7001 


394 




396 


Asull 


398 


Bpul41 


398 


BslCl 


398 


Bspll91 


398 


BstBl 


398 


Csp451 


398 


Lspl 


398 


Nsp7524V 


398 


Nspv 


398 


Sful 


398 


Taql 


398 


TthHBSl 


402 




402 


Dpall 


402 


Kxo91 


402 


Mbol 


402 


NoeU 


402 


Sau3Al 


404 


Dprxl 


412 


Mboll 


427 


Msel 


428 


Ahalll 


428 


Dxal 



428 Swal 

434 Fbal 

434 Fofcl 

435 Bell 
435 BslQl 
435 BspAl 
435 Dpcill 
435 Kzo91 
435 Mbol 
435 Ndell 
435 Sau3Al 
437 Dpnl 

440 Fbal 

441 Maelll 

442 Nlalll 
445 Rphl 
462 Nlalll 
478 Hgal 
495 Afll 



495 Asul 

495 Avail 

495 Bael81 

495 BsiZl 

495 Cfrl3l 

495 Eco47l 

495 Eco471 

495 Nlalll 

«5 Nspail 

495 NsolV 

495 Sau96l 

495 Slnl 

497 BscBl 

497 NlalV 

501 SfaKL 

506 Dsav 

506 EcoRll 

508 Apyl 

508 BsiLl 

508 BstKL 

508 BstOl 

508 Mval 

508 ScrFl 

523 EcoRl* 



523 


Fofcl 


523 


Tsp509 1 


536 


Msel 


537 


Ahalll 


- 537 


Oral 


543 


Maell 


553 


Alul 


563 


EcoRl* 


563 


Tsp509 1 
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Csp61 
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Afal 


574 


Rsal 


574 


Seal 


602 


Nlalll 


603 


BSUR1 


603 


Haelll 


603 


Pall 


610 


Bsin 


610 


Bsll 


615 




615 


MWO l 


625 


Maell 


629 


Fofcl 


636 


AclX 


656 


Mnll 


657 


BspAl 


657 


Bsm 


657 


DpciU 


657 


K2091 


657 


Mbol 



657 Mfll 

657 Noell 

6S7 Sau3Al 

657 xittll 

659 Dpnl 

665 Alvl 

665 BscBl 

665 NlalV 



667 


Acil 


668 


Alvl 


669 


Accll 


669 


Bshl236 1 


669 


BspSOl 


669 


BstOl 


669 


FnuDll 


669 


Mvnl 


669 


Thai 


673 


Basal 


673 


BspAl 


673 


Bsm 


673 


Dpnll 


673 


K2o91 


673 


Mbol 


673 


Mfll 


673 


Noell 


673 


Sau3Al 


673 


Xholl 


675 


BscBl 


675 


Dpnl 


675 


NlalV 


677 


BsaJl 


677 


Bsall 


677 


Dsav 


677 


Seel 


678 


Aqul 


678 


Aval 


678 


Bcol 


678 


BsaJl 


678 


Bsall 


678 


C£r91 


678 


DsaV 


678 


Eco881 


678 


PspAl 


678 


Seel 


678 


Xcyl 


678 


Xmal 




Ahal 


679 


Benl 


679 


Bapll 


679 


Bpall 


679 


Mspl 


679 


Nell 


679 


ScrFl 


680 


Ahal 


680 


Bcnl 


680 


Nell 


680 


ScrFl 


680 


SmaJ 


661 


Alvl 


663 


Aool 


683 


ECORI* 


683 


EcoRl 


683 


Tsp509 1 



10 



EP 0 646 646 A2 



SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANT: Soana, Carl 

Fargnoli. Joseph 
Bolen, Joseph B. 

(ii) TITLE OF INVENTION: PROTEIN EXPRESSION SYSTEM 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Burton Rodney 

(B) STREET: P.O. Box 4000 

(C) CITY: Princeton 

(D) STATE: New Jersev 

(E) COUNTRY: U.S.A. 
<F> ZIP: 08543-4000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1. 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Gaul, Timothy J. 

(B) REGISTRATION NUMBER: 33.111 

(C) REFERENCE/DOCKET NUMBER: DC25 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (609) 252-5901 

(B) TELEFAX: 609) 252-4526 
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■2- INFORMATION FOR SEC ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

Ik) LENGTH: 693 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: cDNA 



K> (ix) FEATURE: 

<A> NAME /KEY: CDS 
(B) LOCATION: 1..693 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG TCC CCT ATA CTA GGT TAT TGG AAA ATT AAG GGC CTT GTG CAA CCC 48 

Met Ser Pro lie Leu Gly Tyr Trp Lys lie Lvs Gly Leu Val Gin Pro 

1 5 10 15 

ACT CGA CTT CTT TTG GAA TAT CTT GAA GAA AAA TAT GAA GAG CAT TTG 56 

20 Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 

20 25 30 

TAT GAG CGC GAT GAA GGT GAT AAA TGG CGA AAC AAA AAG TTT GAA TTG 144 

Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lvs Phe Glu Leu 

35 40 45 



25 



30 



GGT TTG GAG TTT CCC AAT CTT CCT TAT TAT ATT GAT GGT GAT GTT AAA 192 
Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr lie Asp Gly Asp Val Lys 
50 55 60 

TTA ACA CAG TCT ATG GCC ATC ATA CGT TAT ATA GCT GAC AAG CAC AAC 240 

Leu Thr Gin Ser Met Ala He He Arg Tyr He Ala Asp Lys His Asn 
65 70 75 80 

ATG TTG GGT GGT TGT CCA AAA GAG CGT GCA GAG ATT TCA ATG CTT GAA 23 3 

Met Leu Gly Gly Cys Pro Lys Glu Arg Ala Glu He Ser Met Leu Glu 
85 90 95 

GGA GCG GTT TTG GAT ATT AGA TAC GGT GTT TCG AGA ATT GCA TAT AGT -"«6 

Gly Ala Val Leu Asp He Arg Tyr Gly Val Ser Arg He Ala Tyr Ser 
100 105 HO 

AAA GAC TTT GAA ACT CTC AAA GTT GAT TTT CTT AGC AAG CTA CCT GAA 2 34 

Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lvs Leu Pro Glu 

115 120 125 

ATG CTG AAA ATG TTC GAA GAT CGT TTA TGT CAT AAA ACA TAT TTA AAT 4 2; 

Met Leu Lvs Met Phe Glu Asp Arg Leu Cvs His Lvs Thr Tyr Leu Asn 

130 135 140 



50 



55 
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GGT GAT CAT GTA ACC CAT CCT GAC TTC ATG TTG TAT GAC GCT CTT GAT 4aC 
Gly Asd Kis Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp 
145 -50 155 16C 

GTT GTT TTA TAC ATG GAC CCA ATG TGC CTG GAT GCG TTC CCA AAA TTA 52? 
Val Val Leu Tvr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu 
165 170 175 

GTT TGT TTT AAA AAA CGT ATT GAA GCT ATC CCA CAA ATT GAT AAG TAC 57 6 

Val Cys Phe Lys Lvs Arg lie Glu Ala lie Pro Gin lie Asp Lys Tyr 
180 " 185 190 

TTG AAA TCC AGC AAG TAT ATA GCA TGG CCT TTG CAG GGC TGG CAA GCC 62 4 

Leu Lvs Ser Ser Lys Tyr He Ala Trp Pro Leu Gin Gly Trp Gin Ala 
195 200 205 

ACG TTT GGT GGT GGC GAC CAT CCT CCA AAA TCG GAT CTG GTT CCG CGT ■ f 2 

15 Thr Phe Gly Glv Glv Aso His Pro Pro Lys Ser Asp Leu Val Pro Arg 

210 215 220 

GGA TCC CCG GGA ATT CAT CGT 69 3 

Gly Ser Pro Gly He His Arg 
225 230 



TO 



20 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 
25 (B) TYPE: amino acid 

IV) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : . SEQ ID NO: 2: 



Met Ser Pro He Leu Gly Tyr Trp Lys He Lys Gly Leu Val Gin Pro 
15 10 15 

Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 
20 25 30 

Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu 
35 40 45 

Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr He Asp Gly Asp Val Lys 
50 55 60 

Leu Thr Gin Ser Met Ala He He Arg Tyr lie Ala Asp Lys His Asn 
65 70 75 80 



45 



50 



55 
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10 



15 



20 



25 



Met Leu 

Gly Ala 

Lys Asp 

Met Leu 
130 

Gly Asp 
145 

Val Val 
Val Cys 
Leu Lys 



Thr Phe 
210 

Gly Ser 
225 



Gly Gly Cys 
85 

Val Leu Aso 
100 

Phe Glu Thr 
115 

Lys Met Phe 



His Val Thr 

Leu Tyr Met 
165 

Phe Lys Lys 
180 

Ser Ser Lys 
195 

Gly Gly Gly 
Pro Gly lie 



Pro Lys Glu 

lie Arg Tyr 

Leu Lys Val 
120 

Glu Aso Arg 

13*5 

His Pro Asp 
150 

Asp Pro Met 
Arg lie Glu 



Tyr He Ala 
200 

Aso His Pro 

215 

His Arg 
230 



Arg Ala 
90 



Glu He 



Gly Val 
105 



Ser Arg 
Asp Phe Leu Ser 



Leu Cys 

Phe Met 

Cys Leu 
170 

Ala He 
185 

Trp Pro 
Pro Lys 



His Lys 
140 

Leu Tyr 

155 

Asp Ala 
Pro Gin 
Leu Gin 



Ser Asd 

220 



Ser Met Leu Glu 
95 

He Ala Tvr Ser 
110 

Lvs Leu Pro Glu 
125 

Thr Tyr Leu Asn 



Asp Ala Leu Asp 

160 

Phe Pro Lvs Leu 
175 

He Asp Lys Tvr 
190 

Gly Trp Gin Ala 
205 

Leu Val Pro Arg 



Claims 

1. An expression vector, comprising: 

35 < a ) a fir s* coding region, which codes for a polypeptide capable of binding to glutathione, 

operatively connected to a promoter. 

(b) a second coding region in-frame with the first coding region, and 

(c) at least one restriction site between the first and second coding regions; 

wherein a fusion protein of the first and second coding regions would result from expression of the 
40 vector. 

2. A host cell, comprising the vector of Claim 1 . 

3. A process for isolating and purifying a protein, which comprises: 

45 (a) treating the host cell of Claim 2 under conditions allowing expression of the vector, whereby a 

fusion protein of the first and second coding regions will be expressed; 

(b) exposing proteins from the host cell to glutathione resin, whereby the fusion protein will adhere to 
the resin; and 

(c) cleaving the expression product of the second coding region from the resin-bound fusion protein 

50 

4. A process for expressing a nucleic acid sequence, which comprises: 

(a) inserting the nucleic acid sequence into a baculovirus expression vector in-frame with a first 
coding region for a polypeptide capable of binding to glutathione wherein the first coding region is 
operatively linked to a promot r; 
55 (b) placing th vector into a host c II; 

(c) treating the host cell under conditions allowing expression of the vector, resulting in expression of 
a fusion protein of th first coding region and th sequence inserted in step (a); 
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(d) exposing prot ins from the host cell to glutathione resin, wher by the fusion protein will adh re to 
the resin; and 

(e) treating the adhered fusion protein with a protease to release the expression product of the 
nucleic acid sequence from the resin. 

5 

5. The expression vector of Claim 1 , wherein the promoter is a baculovirus promoter. 

6. The host cell of Claim 2, wherein the cell is a Spodoptera frugiperda cell. 

jo 7. The host cell of Claim 2, wherein the cell is a Spodoptera frugiperda cell and the expression vector 
comprises a baculovirus promoter. 

a The process of Claim 3. wherein the host cell is a Spodoptera frugiperda cell and the promoter is a 
baculovirus promoter. 

T5 

9. The process of Claim 4, wherein the host cell is a Spodoptera frugiperda cell and the promoter is a 
baculovirus promoter. 

10. The host cell of Claim 2, wherein the cell is an Sf9 cell. 

20 

11. The host cell of Claim 2, wherein the cell is an Sf9 cell and the promoter is a baculovirus promoter. 

12. The process of Claim 3, wherein the host cell is an Sf9 cell and the promoter is a baculovirus promoter. 
25 13. The process of Claim 4, wherein the host cell is an Sf9 cell and the promoter is a baculovirus promoter. 

14. The vector of Claim 1, wherein the second coding region is Lck. LynB. Syk, Blk, Fyn, or Yes. 

15. The host cell of Claim 2, wherein the second coding region is Lck, LynB, Syk, Blk, Fyn, or Yes .. 

30 

16. The process of Claim 3, wherein the second coding region is Lck, LynB, Syk. Blk, Fyn, or Yes .. 

17. The process of Claim 4, wherein the target protein is Lck protein. 

35 18. The expression vector of Claim 1, wherein the first coding region encodes glutathione-s-transf erase. 

19. The host cell of Claim 2, wherein the first coding region encodes glutathione-s-transf erase. 

20. The process of Claim 3, wherein the first coding region encodes glutathione-s-transferase. 

40 

21. The process of Claim 4, wherein the first coding region encodes glutathione-s-transferase. 



50 



55 
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